Figure 1: 161P2F10B SSH sequence of 182 nucleotides 



1 GATCACACAT TAGGTTATNG ACTTCAATAT 

61 TAAAACTGGG TCCATGTGCC AAGAAAGATA 

121 CCATGGTTGC CTCCTCCACA ATTTGTATTT 

181 TC 



TTTCAAATGG TTCAACTTCA GTCTTCTCTT 
GCCTCCATGC TCCTAAACTC ATTGTTATAA 
GATTTACTCC TAACAGCCAG CCACTGTTGA 
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Figure 2. 

Figure 2A. The cDNA (SEQ ID. NO. : ) and amino acid sequence (SEQ ED. NO. : ) of 

161P2F10B. The 3858 nucleotide sequence of 161P2F10B is shown. The codon for the start methionine is 
underlined. The open reading frame extends from nucleic acid 44-2671 including the stop codon. 

1 M E S T L T 

1 ctactttattctgataaaacaggtctatgcagctaccaggacaATGGAATCTACGTTGAC 
7LATEQPVKKNTLKKYKIACI 
6 1 TTTAGCAACGGAACAACCTGTTAAGAAGAACACTCTTAAGAAATATAAAATAGCTTGCAT 
27VLLALLVIMSLGLGLGLGLR 
121 TGTTCTTCTTGCTTTGCTGGTGATCATGTCACTTGGATTAGGCCTGGGGCTTGGACTCAG 
47 KLEKQGSCRKKCFDASFRGL 
U 181 GAAACTGGAAAAGCAAGGCAGCTGCAGGAAGAAGTGCTTTGATGCATCATTTAGAGGACT 

Q 67ENCRCDVACKDRGDCCWDFE 
y 241 GGAGAACTGCCGGTGTGATGTGGCATGTAAAGACCGAGGTGATTGCTGCTGGGATTTTGA 

j* 87DTCVESTRIWMCNKFRCGET 
03 3 01 AGACACCTGTGTGGAATCAACTCGAATATGGATGTGCAATAAATTTCGTTGTGGAGAGAC 

M 107 RLEASLCSCSDDCLQKKDCC 

361 CAGATTAGAGGCCAGCCTTTGCTCTTGTTCAGATGACTGTTTGCAGAAGAAAGATTGCTG 
H s 127 ADYKSVCQGETSWLEENCDT 

1^ 421 TGCTGACTATAAGAGTGTTTGCCAAGGAGAAACCTCATGGCTGGAAGAAAACTGTGACAC 

Q 147 AQQSQCPEGFDLPPVILFSM 

^ 4 81 AGCCCAGCAGTCTCAGTGCCCAGAAGGGTTTGACCTGCCACCAGTTATCTTGTTTTCTAT 

167 DGFRAEYLYTWDTLMPNINK 
541 GGATGGATTTAGAGCTGAATATTTATACACATGGGATACTTTAATGCCAAATATCAATAA 
187 LKTCGIHSKYMRAMYPTKTF 
6 01 ACTGAAAACATGTGGAATTCATTCAAAATACATGAGAGCTATGTATCCTACCAAAACCTT 
207 PNHYTIVTGLYPESHGIIDN 
661 CC C AAATCATTAC ACCATTGTCACGGGCTTGTATC CAGAGTCAC ATGGCAT CATTGACAA 
227 NMYDVNLNKNFSLSSKEQNN 
721 TAATATGTATGATGTAAATCTCAACAAGAATTTTTCACTTTCTTCAAAGGA^.CAAAATAA 
247 PAWWHGQPMWLTAMYQGLKA 
781 TCCAGCCTGGTGGCATGGGCAACCAATGTGGCTGACAGCAATGTATCAAGGTTTAAAAGC 
267 ATYFWPGSEVAINGSFPSIY 
841 CGCTACCTACTTTTGGCCCGGATCAGAAGTGGCTATAAATGGCTCCTTTCCTTCCATATA 
287 MPYNGSVPFEERISTLLKWL 
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901 CATGCCTTACAA.CGGAAGTGTCCCATTTGAAGAGAGGATTTCTACACTGTTAAAATGGCT 
307 DLPKAERPRFYTMYFEE PDS 
961 GGACCTGCCCAAAGCTGAAAGACCCAGGTTTTATACCATGTATTTTGAAGAAGCTGATTC 
327 SGHAGGPVSARVI KALQtfVD 

1021 CTCTGGACATGCAGGTGGACCAGTCAGTGCCAGAGTAATTAAAGCCTTACAGGTAGTAGA 
347 HAFGMLMEGLKQRNLHNCVN 

1081 TCATGCTTTTGGGATGTTGATGGAAGGCCTGAAGCAGCGGAATTTGCACAACTGTGTCAA 
367 I ILLADHGMDQTYCNKMEYM 

114 1 TATCATCCTTCTGGCTGACCATGGAATGGACCAGACTTATTGTAACAAGATGGAATACAT 
387 TDYFPRINFFYMYEGPAPRI 

1201 GACTGATTATTTTCCCAGAATAAACTTCTTCTACATGTACGAAGGGCCTGCCCCCCGCAT 
407 RAHNIPHDFFSFNSEEIVRN 

12 61 CCGAGCTCATAATATACCTCATGACTTTTTTAGTTTTAATTCTGAGGAAATTGTTAGAAA 
427 LSCRKPDQHFKPYLTPDLPK 

1321 CCTCAGTTGCCGAAAACCTGATCAGCATTTCAAGCCCTATTTGACTCCTGATTTGCCAAA 
447 RLHYAKNVRIDKVHLFVDQQ 

13 81 GCGACTGCACTATGCC AAGAACGTCAGAATCGACAAAGTTCATCTCTTTGTGGATCAACA 
467 WLAVRS KSNTNCGGGNHGYN 

1441 GTGGCTGGCTGTTAGGAGTAAATCAAATACAAATTGTGGAGGAGGCAACCATGGTTATAA 

487 NEFRSMEAIFLAHGPSFKEK 
15 01 CAATGAGTTTAGGAGCATGGAGGCTATCTTTCTGGCACATGGACCCAGTTTT AAAGAGAA 

507 TEVEPFENIEVYNLMCDLLR 
1561 GACTGAAGTTGAACCATTTGAAAATATTGAAGTCTATAACCTAATGTGTGATCTTCTACG 

527 IQPAPNNGTHGSLNHLLKVP 
1621 CATTCAACCAGCACCAAACAATGGAACCCATGGTAGTTTAAACCATCTTCTGAAGGTGCC 

547 FYEPSHAEEVSKFSVCGFAN 
1681 TTTTTATGAGCCATCCCATGCAGAGGAGGTGTCAAAGTTTTCTGTTTGTGGCTTTGCTAA 

567 PLPTESLDCFCPHLQNSTQL 
1741 TCCATTGCCCACAGAGTCTCTTGACTGTTTCTGCCCTCACCTACAAAATAG1ACTCAGCT 

587 EQVNQMLNLTQEEITATVKV 
18 01 GGAACAAGTGAATCAGATGCTAAATCTCACCCAAGAAGAAATAACAGCAACAGTGAAAGT 

607 NLPFGRPRVLQKNVDHCLLY 
1861 AAATTTGCCATTTGGGAGGCCTAGGGTACTGCAGAAGAACGTGGACCACTGTCTCCTTTA 

627 HREYVSGFGKAMRMPMWSSY 
1921 C CACAGGGAATATGTCAGTGGATTTGGAAAAGCTATGAGGATGCC CATGTGG AGTTCATA 

647 TVPQLGDTSPLPPTVPDCLR 
1981 CACAGTCCCCCAGTTGGGAGACACATCGCCTCTGCCTCCCACTGTCCCAGACTGTCTGCG 

667 ADVRVPPSESQKCS FYLADK 
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2 041 GGCTGATGTCAGGGTTCCTCCTTCTGAGAGCCAAAAATGTTCCTTCTATTTAGCAGACAA 

687 NITHGFLYPPASNRTSDSQY 
2101 GAATATCAC C CACGGCTTC C TCTATC CTC CTGCCAGC AATAGAACATC AGATAGC CAATA 

707 DALITSNLVPMYEEFRKMWD 
2161 TGATGCTTTAATTACTAGCAATTTGGTACCTATGTATGAAGAATTCAGAAA^ ATGTGGGA 

727 YFHSVLLIKHATERNGVNVV 
2221 CTACTTCCACAGTGTTCTTCTTATAAAACATGCCACAGAAAGAAATGGAGTPAATGTGGT 

747 SGPIFDYNYDGHFDAPDEIT 
22 81 TAGTGGACCAATATTTGATTATAATTATGATGGCCATTTTGATGCTCCAGATGAAATTAC 

767 KHLANTDVP I PTHYFVVLTS 
2 341 CAAACATTTAGCCAACACTGATGTTCCCATCCCAACACACTACTTTGTGGTGCTGACCAG 

787 CKNKSHTPENCPGWLDVLPF 
24 01 TTGTAAAAACAAGAGCCACACACCGGAAAACTGCCCTGGGTGGCTGGATGTCCTACCCTT 
y s 807 IIPHRPTNVESCPEGKPEAL 

O 2461 TATCATCCCTCACCGACCTACCAACGTGGAGAGCTGTCCTC^ 

□ 

827 WVEERFTAHIARVRDVELLT 

j|j 2521 TTGGGTTGAAGAAAGATTTACAGCTCACATTGCCCGGGTCCGTGATGTAGAACTTCTCAC 

£ 847 GLDFYQDKVQPVSEILQLKT 

g 2 5 81 TGGGCTTGACTTCTATCAGGATAAAGTGCAGCCTGTCTCTGAAATTTTGCAPlCTAAAGAC 

* 867 YLPTFETTI * 

; ;55S 2641 ATATTTACCAACATTTGAAACCACTATTTAActtaataatgtctacttaatatataattt 

r-i 27 01 actgtataaagtaattttggcaaaatataagtgattttttctggagaattgtaaaataaa 

HI 2 7 61 gttttctatttttccttaaaaaaaaaaccggaattccgggcttgggaggctgaggcagga 

r:" 2 821 gactcgcttgaacccgggaggcagaggttgcagtgagccaagattgcgccattgcactcc 

2 8 81 agagcctgggtgacagagcaagactacatctcaaaaaataaataaataaaataaaagtaa 
2941 caataaaaataaaaagaacagcagagagaatgagcaaggagaaatgtcacaaactattgc 

3 001 aaaatactgttacactgggttggctctccaagaagatactggaatctcttcagccatttg 
3 061 cttttcagaagtagaaaccagcaaaccacctctaagcggagaacatacgattctttatta 
3121 agtagctctggggaaggaaagaataaaagttgatagctccctgattgggaaaaaatgcac 
3181 aattaataaagaatgaagatgaaagaaagcatgcttatgttgtaacacaaaaaaaattca 
3241 caaacgttggtggaaggaaaacagtatagaaaacattactttaactaaaagctggaaaaa 
3301 ttttcagttgggatgcgactgacaaaaagaacgggatttccaggcataaagttggcgtga 
33 61 gctacagagggcaccatgtggctcagtggaagacccttcaagattcaaagttccatttga 
3 421 cagagcaaaggcacttcgcaaggagaagggtttaaattatgggtccaaaagccaagtggt 
3 4 81 aaagcgagcaatttgcagcataactgcttctcctagacagggctgagtgggcaaaatacg 
3541 acagtacacacagtgactattagccactgccagaaacaggctgaacagccctgggagaca 
3 601 agggaaggcaggtggtgggagttgttcatggagagaaaggagagttttagaaccagcaca 
3661 tccactggagatgctgggccaccagacccctcccagtcaataaagtctggtgcctcattt 
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3721 gatctcagcctcatcatgaccctggagagaccctgataccatctgccagtccccgacagc 
3781 ttaggcactccttgccatcaacctgaccccccgagtggttctccaggctccctgccccac 
3 841 ccattcaggccggaattc 

Figure 2B: The cDNA (SEQ ID. NO. : ) and amino acid sequence (SEQ ID. NO. : ) of 

161P2F10B variant 1. The 3858 nucleotide sequence of 161P2F10B variant 1 is shown. The start methionine is 
underlined. The open reading frame extends from nucleic acid 44-2671 including the stop codon. 

1 M E S T L T 

1 ctactttattctgataaaacaggtctatgcagctaccaggacaATGGAATCTACGTTGAC 
7LATEQPVKKNTLKKYKIAC 1. 

6 1 TTTAGCAACGGAACAACCTGTTAAGAAGAACACTCTTAAGAAATATAAAATAGCTTGCAT 

27VLLAL LV I MS LGLGLGLGLR 
121 TGTTCTTCTTGCTTTGCTGGTGATCATGTCACTTGGATTAGGCCTGGGGCTTGGACTCAG 

47 KLE KQG S CRKKC FDAS F R G L 
181 GAAACTGGAAAAGCAAGGCAGCTGCAGGAAGAAGTGCTTTGATGCATCATTTAGAGGACT 

67 ENCRCDVACKDRGDCCWDFE 
241 GGAGAACTGCCGGTGTGATGTGGCATGTAAAGACCGAGGTGATTGCTGCTGGGATTTTGA 

87 DTCVES TRIWMCNKFRCGE T 
301 AGACACCTGTGTGGAATCAACTCGAATATGGATGTGCAATAAATTTCGTTGTGGAGAGAC 
107 RLEASLCSCSDDCLQRKDCC 
3 61 CAGATTAGAGGCCAGCCTTTGCTCTTGTTCAGATGACTGTTTGCAGAGGAAAGATTGCTG 
127ADYKSVCQGETSWLEENCD" 
421 TGCTGACTATAAGAGTGTTTGCCAAGGAGAAACCTCATGGCTGGAAGAAAACTGTGACAC 
147 AQQSQCPEGFDLPPVILFSM 
481 AGCCCAGCAGTCTCAGTGCCCAGAAGGGTTTGACCTGCCACCAGTTATCTTGTTTTCTAT 
167 DGFRAEYLYTWDTLMPNINK 
541 GGATGGATTTAGAGCTGAATATTTATACACATGGGATACTTTAATGCCAAATATCAATiUV 
187 LKTCGIHSKYMRAMYPTKTF 
601 ACTGAAAACATGTGGAATTCATTCAAAATACATGAGAGCTATGTATCCTACCAAAACCTT 
207 PNHYTIVTGLYPESHGI IDN 
661 CCCAAATCATTACACCATTGTCACGGGCTTGTATCCAGAGTCACATGGCATCATTGACi^A 
227 NMYDVNLNKNFSLS SKEQNN 
721 TAATATGTATGATGTAAATCTCAACAAGAATTTTTCACTTTCTTCAAAGGAACAAAATi^A 
247 PAWWHGQ PMWLTAMYQGLKA 
781 TCCAGCCTGGTGGCATGGGCAACCAATGTGGCTGACAGCAATGTATCAAGGTTTAAAAGC 
267 ATYFWPGSEVAINGSFPSIY 
841 CGCTACCTACTTTTGGCCCGGATCAGAAGTGGCTATAAATGGCTCCTTTCCTTCCATATA 
287 MPYNGSVPFEERISTLLKWL 
901 CAT GC C TT AC AACGGAAGTGTCCC AT TTGAAGAGAGGATT T CTAC ACTGT T AAAATGG CT 
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307 DLPKAERPRFYTMYFEEPDS 
961 GGACCTGCCCAAAGCTGAAAGACCCAGGTTTTATACCATGTATTTTGAAGAACCTGATTC 
327 SGHAGGPVSARVI KALQVVD 

1021 CTCTGGACATGCAGGTGGACCAGTCAGTGCCAGAGTAATTAAAGCCTTACAGGTAGTAG A 
347 HAFGMLMEGLKQRNLHNCVN 

1081 TCATGCTTTTGGGATGTTGATGGAAGGCCTGAAGCAGCGGAATTTGCACAACTGTGTCAA 

3 67 I ILLADHGMDQTYCNKMEYK 
1141 TATCATCCTTCTGGCTGACCATGGAATGGACCAGACTTATTGTAACAAGATGGAATAC^.T 

387 TDYFPRINFFYMYEGPAPRI 

12 01 GACTGATTATTTTCCCAGAATAAACTTCTTCTACATGTACGAAGGGCCTGCCCCCCGCAT 

4 07 R A H N I PHDFFSFNSEEIVRN 

427 LSCRKPDQHFKPYLTPDLPK 
1321 CCTCAGTTGCCGAAAACCTGATCAGCATTTCAAGCCCTATTTGACTCCTGATTTGCCA?iA 
447 RLHYAKNVRI DKVHLFVDQQ 

13 81 gcgactgcactatgccaagaacgtcagaatcgacaaagttcatctctttgtggatcaaca 

I'll 

n 4 67 WLAVRS KSNTNCGGGNHGYN 

Q 1441 GTGGCTGGCTGTTAGGAGTAAATCAAATACAAATTGTGGAGGAGGCAACCATGGTTAT^ 

ifi 487 NEFRSMEAI FLAHGPS FKEK 

"5 1501 CAATGAGTTTAGGAGCATGGAGGCTATCTTTCTGGCACATGGACCCAGTTTTAAAGAG^lA 

jSj 507 TEVEPFENI EVYNLMCDLLR 

1561 GACTGAAGTTGAACCATTTGAAAATATTGAAGTCTATAACCTAATGTGTGATCTTCTACG 

I s * 527 I QPAPNNGTHGSLNHLLKVP 

l V. 

1621 CATTCAACCAGCACCAAACAATGGAACCCATGGTAGTTTAAACCATCTTCTGAAGGTGCC 
J^! 547 FYEPSHAEEVSKFSVCGFAN 

f=k 1681 TTTTTATGAGCCATCCCATGCAGAGGAGGTGTCAAAGTTTTCTGTTTGTGGCTTTGCT/iA 

l=h 567 PLPTESLDCFCPHLQNSTQL 

1741 tccattgcccacagagtctcttgactgtttctgccctcacctacaaaatagtactcagc:t 
587 eqvnqmlnltqee i tatvkv 
1801 ggaacaagtgaatcagatgctaaatctcacccaagaagaaataacagcaacagtgaaagt 

607 NLPFGRPRVLQKNVDHCLLY 
1861 AAATTTGCCATTTGGGAGGCCTAGGGTACTGCAGAAGAAGGTGGACCACTGTCTCCTTTA 

627 HREYVSGFGKAMRMPMWSSY 
1921 CCACAGGGAATATGTCAGTGGATTTGGAAAAGCTATGAGGATGCCCATGTGGAGTTCATA 

647 TVPQLGDTSPLPPTVPDCLR 
1981 CACAGTCCCCCAGTTGGGAGACACATCGCCTCTGCCTCCCACTGTCCCAGACTGTCTGCG 

667 ADVRVPPSESQKCSFYLADIC 
2041 GGCTGATGTCAGGGTTCCTCCTTCTGAGAGCCAAAAATGTTCCTTCTATTTAGCAGACi^A 

6 87 NITHGFLYPPASNRTSDSQY 
2101 GAATATCACCCACGGCTTCCTCTATCCTCCTGCCAGCAATAGAACATCAGATAGCCAATA 

707 DALITSNLVPMYEEFRKMWD 
2161 TGATGCTTTAATTACTAGCAATTTGGTACCTATGTATGAAGAATTCAGAAAAATGTGGGA 
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727 YFHSVLLI KHATERNGVNVV 
2221 CTACTTCCACAGTGTTCTTCTTATAAAACATGCCACAGAAAGAAATGGAGTAAATGTGGT 

747 SGPIFDYNYDGHFDAPDEIT 
2281 TAGTGGACCAATATTTGATTATAATTATGATGGCCATTTTGATGCTCCAGATGAAATT/lC 

767 KHLANTDVPIPTHYFVVLTS 
2341 CAAACATTTAGCCAACACTGATGTTCCCATCCCAACACACTACTTTGTGGTGCTGACCAG 

787 CKNKSHTPENCPGWLDVLPF 
2401 TTGTAAAAACAAGAGCCACACACCGGAAAACTGCCCTGGGTGGCTGGATGTCCTACCCTT 

807 I I PHRPTNVESCPEGKPEAL 
2461 TATCATCCCTCACCGACCTACCAACGTGGAGAGCTGTCCTGAAGGTAAACCAGAAGCTCT 

827WVEERFTAHIARVRDVELL" 
2521 TTGGGTTGAAGAAAGATTTACAGCTCACATTGCCCGGGTCCGTGATGTAGAACTTCTCAC 

847 GLDFYQDKVQPVSEILQLK n : 
2581 TGGGCTTGACTTCTATCAGGATAAAGTGCAGCCTGTCTCTGAAATTTTGCAACTAAAGAC 

867 YLPTFETTI * 
;IT 2641 ATATTTACCAACATTTGAAACCACTATTTAActtaataatgtctacttaatatataattt 

2701 actgtataaagtaattttggcaaaatataagtgattttttctggagaattgtaaaataaa 
r| 2761 gttttctatttttccttaaaaaaaaaaccggaattccgggcttgggaggctgaggcagga 

1/1 2821 gactcgcttgaacccgggaggcagaggttgcagtgagccaagattgcgccattgcactcc 

\f* 28 81 agagcctgggtgacagagcaagactacatctcaaaaaataaataaataaaataaaagtaa 

2 941 caataaaaataaaaagaacagcagagagaatgagcaaggagaaatgtcacaaactattgc 

3 001 aaaatactgttacactgggttggctctccaagaagatactggaatctcttcagccatttg 
3061 cttttcagaagtagaaaccagcaaaccacctctaagcggagaacatacgattctttatta 
3121 agtagctctggggaaggaaagaataaaagttgatagctccctgattgggaaaaaatgcac 

r^t 3181 aattaataaagaatgaagatgaaagaaagcatgcttatgttgtaacacaaaaaaaattca 

is«% 3241 caaacgttggtggaaggaaaacagtatagaaaacattactttaactaaaagctggaaaaa 

y ;: 33 01 ttttcagttgggatgcgactgacaaaaagaacgggatttccaggcataaagttggcgtga 

33 61 gctacagagggcaccatgtggctcagtggaagacccttcaagattcaaagttccatttga 
3421 cagagcaaaggcacttcgcaaggagaagggtttaaattatgggtccaaaagccaagtggt 

34 81 aaagcgagcaatttgcagcataactgcttctcctagacagggctgagtgggcaaaatacg 
3541 acagtacacacagtgactattagccactgccagaaacaggctgaacagccctgggagaca 
3601 agggaaggcaggtggtgggagttgttcatggagagaaaggagagttttagaaccagcaca 
3661 tccactggagatgctgggccaccagacccctcccagtcaataaagtctggtgcctcattt 
3721 gatctcagcctcatcatgaccctggagagaccctgataccatctgccagtccccgacagc 
3781 ttaggcactccttgccatcaacctgaccccccgagtggttctccaggctccctgccccac 
3841 ccattcaggccggaattc 
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Figure 3A. Amino acid sequence of 161P2F10B (SEQ ID. NO. : ). The 161P2F10B protein has 875 amino 

acids. 



1 


MESTLTLATE 


QPVKKNTLKK 


YKIACIVLLA 


LLVIMSLGLG 


LGLGLRKLEK 


QGSCRKKCFD 


61 


ASFRGLENCR 


CDVACKDRGD 


CCWDFEDTCV 


ESTRIWMCNK 


FRCGETRLEA 


SLCSCSDDCL 


121 


QKKDCCADYK 


SVCQGETSWL 


EENCDTAQQS 


QCPEGFDLPP 


VILFSMDGFR 


AEYLYTWDTL 


181 


MPNINKLKTC 


GIHSKYMRAM 


YPTKTFPNHY 


TIVTGLYPES 


HGI IDNNMYD 


VNLNKNFSLS 


241 


SKEQNNPAWW 


HGQPMWLTAM 


YQGLKAATYF 


WPGSEVAING 


SFPSIYMPYN 


GSVPFEERIS 


301 


TLLKWLDLPK 


AERPRFYTMY 


FEEPDSSGHA 


GGPVSARVIK ALQWDHAFG 


MLMEGLKQRN 


361 


LHNCVNIILL 


ADHGMDQTYC 


NKMEYMTDYF 


PRINFFYMYE 


GPAPRIRAHN 


IPHDFFSFNS 


421 


EEIVRNLSCR 


KPDQHFKPYL 


TPDLPKRLHY 


AKNVRIDKVH 


LFVDQQWLAV 


RSKSNTNCGG 


481 


GNHGYNNEFR 


SMEAIFLAHG 


PSFKEKTEVE 


PFENIEVYNL 


MCDLLRIQPA 


PNNGTHGSLN 


541 


HLLKVPFYEP 


SHAEEVSKFS 


VCGFANPLPT 


ESLDCFCPHL 


QNSTQLEQVN 


QMLNLTQEEI 


601 


TATVKVNLPF 


GRPRVLQKNV 


DHCLLYHREY 


VSGFGKAMRM 


PMWSSYTVPQ 


LGDTSPLPPT 


661 


VPDCLRADVR 


VPPSESQKCS 


FYLADKNITH 


GFLYPPASNR TSDSQYDALI 


TSNLVPMYEE 


721 


FRKMWDYFHS 


VLLIKHATER 


NGVNWSGPI 


FDYNYDGHFD 


APDEITKHLA 


NTDVPIPTHY 


781 


FWLTSCKNK 


SHTPENCPGW 


LDVLPFIIPH 


RPTNVESCPE 


GKPEALWVEE 


RFTAHIARVR 


841 


DVELLTGLDF 


YQDKVQPVSE 


ILQLKTYLPT 


FETTI 







Figure 3B. Amino acid sequence of 161P2F10B variant 1 (SEQ ID. NO. : ). The 161P2F10B variant 1 

protein has 875 amino acids. 



1 


MESTLTLATE 


QPVKKNTLKK 


YKIACIVLLA 


LLVIMSLGLG 


LGLGLRKLEK 


QGSCRKKCFD 


61 


ASFRGLENCR 


CDVACKDRGD 


CCWDFEDTCV 


ESTRIWMCNK 


FRCGETRLEA 


SLCSCSDDCL 


121 


QRKDCCADYK 


SVCQGETSWL 


EENCDTAQQS 


QCPEGFDLPP 


VILFSMDGFR 


AEYLYTWDTL 


181 


MPNINKLKTC 


GIHSKYMRAM 


YPTKTFPNHY 


TIVTGLYPES 


HGI IDNNMYD 


VNLNKNFSLS 


241 


SKEQNNPAWW 


HGQPMWLTAM 


YQGLKAATYF 


WPGSEVAING 


SFPSIYMPYN 


GSVPFEERIS 


301 


TLLKWLDLPK 


AERPRFYTMY 


FEEPDSSGHA 


GGPVSARVIK 


ALQWDHAFG 


MLMEGLKQRN 


361 


LHNCVNIILL 


ADHGMDQTYC 


NKMEYMTDYF 


PRINFFYMYE 


GPAPRIRAHN 


IPHDFFSFNS 


421 


EEIVRNLSCR 


KPDQHFKPYL 


TPDLPKRLHY 


AKNVRIDKVH 


LFVDQQWLAV 


RSKSNTNCGG 


481 


GNHGYNNEFR 


SMEAIFLAHG 


PSFKEKTEVE 


PFENIEVYNL 


MCDLLRIQPA 


PNNGTHGSLN 


541 


HLLKVPFYEP 


SHAEEVSKFS 


VCGFANPLPT 


ESLDCFCPHL 


QNSTQLEQVN 


QMLNLTQEEI 


601 


TATVKVNLPF 


GRPRVLQKNV 


DHCLLYHREY 


VSGFGKAMRM 


PMWSSYTVPQ 


LGDTSPLPPT 


661 


VPDCLRADVR VPPSESQKCS 


FYLADKNITH 


GFLYPPASNR 


TSDSQYDALI 


TSNLVPMYEE 


721 


FRKMWDYFHS 


VLLIKHATER 


NGVNWSGPI 


FDYNYDGHFD 


APDEITKHLA 


NTDVPIPTHY 


781 


FWLTSCKNK 


SHTPENCPGW 


LDVLPFIIPH 


RPTNVESCPE 


GKPEALWVEE 


RFTAHIARVR 


841 


DVELLTGLDF 


YQDKVQPVSE 


ILQLKTYLPT 


FETTI 







Figure 4 
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Figure 4A. Amino acid alignment of 161P2F10B with ENPP3. 
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161P2F10B ldvlpf i iphrptnvescpegkpeal wveerf tahiarvr 840 

ENPP3 ldvlpf i iphrptnvescpegkpealwveerftahiarvr 840 

161P2F10B dvelltgldfyqdkvqpvseilqlktylptfetti 875 

ENPP3 dvelltgldfyqdkvqpvseilqlktylptfetti 875 



Figure 4b. Amino acid alignment of 161P2F10B with 161P2F10B variant 1. 

161P2F10B mestltlateqpvkkntlkkykiacLvllallvimslglg 40 

161P2F10B variant 1 mestltlateqpvkkntlkkykiacivllallvimslglg 40 

161P2F10B IglglrklekqgscrkkcfdasfrgLencrcdvackdrgd 80 

161P2F10B variant 1 Iglglrklekqgscrkkcfdasfrglencrcdvackdrgd 80 

161P2F10B ccwdfedtcvestriwmcnkfrcgetrleaslcscsddcl 12C 

161P2F10B variant 1 ccwdfedtcvestriwmcnkfrcgetrleaslcscsddcl 12C 

161P2F10B qH kdccadyksvcqgetswleencdtaqqsqcpegfdlpp 16C 

161P2F10B variant 1 gQ kdccadyksvcqgetswleencdtaqqsqcpegfdlpp 16C 

161P2F10B vilfsmdgfraeylytwdtlmpninklktcgihskymram 20C 

161P2F10B variant 1 vilfsmdgfraeylytwdtlmpninklktcgihskymram 20C 

161P2F10B yptktfpnhytivtglypeshgiidnnmydvnlnknfsls 24C 

161P2F10B variant 1 yptktfpnhytivtglypeshgiidnnmydvnlnknfsls 24C 

161P2F10B skeqnnpawwhgqpmwltamyqglkaatyfwpgsevaing 28C 

161P2F10B variant 1 skeqnnpawwhgqpmwltamyqglkaatyfwpgsevaing 28C 

161P2F10B sfpsiympyngsvpfeeristllkwldlpkaerprfytmy 32C 

161P2F10B variant 1 sfpsiympyngsvpfeeristllkwldlpkaerprfytmy 32C 

161P2F10B feepdssghaggpvsarvikalqvvdhafgmlmeglkqrn 36C 

161P2F10B variant 1 feepdssghaggpvsarvikalqvvdhafgmlmeglkqrn 36C 

161P2F10B lhncvni i 1 ladhgmdqtycnkmeymtdyfprinf fymye 40C 

161P2F10B variant 1 lhnrvniilladhgmdqtycnkmeymtdyfprinffymye 40C 

161P2F10B gpaprirahniphdffsfnseeivrnlscrkpdqhfkpyl 44C 

161P2F10B variant 1 gpaprirahniphdffsfnseeivrnlscrkpdqhfkpyl 44C 

161P2F10B tpdlpkrlhyaknvridkvhlfvdqqwlavrsksntncgg 48C 

161P2F10B variant 1 tpdlpkrlhyaknvridkvhlfvdqqwlavrsksntncgg 48C 

161P2F10B gnhgynnefrsmeaiflahgpsfkektevepfenievynl 52C 

161P2F10B variant 1 gnhgynnefrsmeaiflahgpsfkektevepfenievynl 52C 

161P2F10B mcdllriqpapnngthgslnhllkvpfyepshaeevskfs 56C 

161P2F10B variant 1 mcdllriqpapnngthgslnhllkvpfyepshaeevskfs 56C 

161P2F10B vcgfanplptesldcfcphlqnstqleqvnqmlnltqeei 60C 

161P2F10B variant 1 vcgfanplptesldcfcphlqnstqleqvnqmlnltqeei 60C 

161P2F10B tatvkvnlpfgrprvlqknvdhcllyhreyvsgfgkamrm 64C 

161P2F10B variant 1 tatvkvnlpfgrprvlqknvdhcllyhreyvsgfgkamrm 64C 

161P2F10B pmwssytvpqlgdtsplpptvpdclradvrvppsesqkcs 68C 

161P2F10B variant 1 pmwssytvpqlgdtsplpptvpdclradvrvppsesqkcs 68C 

161P2F10B fyladknithgflyppasnrtsdsqydalitsnlvpmyee 72C 

161P2F10B variant 1 fyladknithgflyppasnrtsdsqydalitsnlvpmyee 72C 
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161P2F10B frkmwdyfhsvllikhaterngvnvvsgpifdynydghfd 76C 
161P2F10B variant 1 frkmwdyfhsvllikhaterngvnvvsgpifdynydghfd 76C 
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4C) Alignment of 161P2F10B and SNP variant 2 carrying a T to P mutation at position 874. 

Query: 492 MEAIFLAHGPSFKEKTEVEPFENIEVYNLM 551 

MEAIFLAHGPSFKEKTEVEPFENIEVYNLMCDLLRIQPAPNNGTHGSLNHLLKVPFYEPS 
Sbjct: 1 MEAIFLAHGPSFKEKTEVEPFENIEVYNLMCDLLRIQPAP1WGTHGSLNHLLKVPFYEPS 60 

Query: 552 HAEEVSKFSVCGFANPLPTESLDCFCPHLQNSTQLEQVNQMLNLTQEEITATVKVNLPFG 611 

HAEEVSKJFSVCGFANPLPTESLDCFCPHLQNSTQLEQWQMLNLTQEEITAT T v^VNLPFG 
Sbjct: 61 HAEEVSKFSVCGFANPLPTESLDCFCPHLQNSTQL^ 120 

3 Query: 612 RPRVLQKNVDHCLLYHREWSGFGKAMRMPMWSSYTWQLGDT 

a 67i 

IFl RPRVLQKNVDHCLLYHREWSGFGKAMRMPMWSSYTVPQLGDTSPLPPTVPDC 

:JS Sbjct: 121 RPRVLQKNVDHCLLYHREYVSGFGKAMRMPMWSSYTWQLGDTSPLPPTW 180 

\B 

q Query: 672 PPSESQKCSFYLADKNITHGFLYPPASNRTSDSQYDALITSNLVPM YEEF'RKMWD YFHS V 73 1 

7 PPSESQKCSFYLADKMTHGFLYPPASNRTSDSQYDALITSNLVPMYEEFRKMWDYFHSV 

u Sbjct: 181 PPSESQKCSFYLADKNITHGFLYPPASNRTSDSQTO^ 240 

;!! Query: 732 LLIKHATERNGVNWSGPIFDYNY 791 

H LLIKHATERNGVNVVSGPIFDYNYDGHFDAPDEITKHLANTDVPIPTHYFVVLTSCKNKS 

^ Sbjct: 241 LLIKHATERNGVNWSGPIFDYNYD^ 300 

O 

M Query: 792 HTPENCPGWLDVLPFIIPHRPTNVESCPEGKPEA^ 85 1 
HTPENCPGWLDVLPFIIPHRPTNVESCPEGKPEALWVEERFTAHIARVRDVELLTGLDFY 
Sbjct: 301 HTPENCPGWLDVLPFIIPHRPTN^SCPEGKPEALWVEERFTAHIARVRDVELLTGLDFY 360 

Query: 852 QDKVQPVSEILQLKTYLPTFETp 875 

QDKVQPVSEILQLKTYLPTFET I 
Sbjct: 361 QDKVQPVSEILQLKTYLPTFETPI 384 
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Figure 5: 161 P2F10B Hydrophilicity profile 

(Hopp TP., Woods K.R., 1981. Proc. Natl. Acad. Sci. U.S.A. 78:3824-3828) 




Figure 6: 161P2F10B Hydropathicity Profile 

(Kyte J., Doolittle R.F., 1982. J. Mol. Biol. 157:105-132) 
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Figure 7: 161P2F10B % Accessible Residues Profile 

(JaninJ., 1979. Nature 277:491-492) 
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Figure 8: 161P2F10B Average Flexibility Profile 

(Bhaskaran R., Ponnuswamy P.K., 1988. 
Int. J. Pept. Protein Res. 32:242-255) 
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Figure 9: 161P2F10B Beta-turn Profile 

(Deleage, G., Roux B. 1987. Protein Engineering 1:289-294) 
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Figure 19A 
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ccchhhhhhhhhcccchhhhhhhhhhhhhhhhhehhccccccceeecccccccccccccch 

NTDVPIPTHYFWLTSCKNKSHTPENCPGWLDVLPFIIPHRPTNVESCPEGKPEALWVEERFTAHIARVR 

cccccccceeeeeeeecccccccccccccccccceeeecccccccccccccccchhhhhhhhhhhhhhhh 

DVELLTGLDFYQDKVQPVSEILQLKTYLPTFETTI 

hheehcccccchcchhchhhhhehhhhcccccccc 



c: random coil (31.31%) 

e: extended strand (11.31%) 

h: alpha helix (57.37%) 
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